Visual Comparison of Datasets using Mixture Decompositions
نویسندگان
چکیده
This article describes how a mixture of two densities, f0 and f1, may be decomposed into a different mixture consisting of three densities. These new densities, f+, f−, and f=, summarize differences between f0 and f1: f+ is high in areas of excess of f1 compared to f0; f− represents deficiency of f1 compared to f0 in the same way; f= represents commonality between f1 and f0. The supports of f+ and f− are disjoint. This decomposition of the mixture of f0 and f1 is similar to the set-theoretic decomposition of the union of two sets A and B into the disjoint sets A\B, B\A, and A ∩ B. Sample points from f0 and f1 can be assigned to one of these three densities, allowing the differences between f0 and f1 to be visualized in a single plot, a visual hypothesis test of whether f0 is equal to f1. We describe two similar such decompositions and contrast their behavior under the null hypothesis f0 = f1, giving some insight into how such plots may be interpreted. We present two examples of uses of these methods: visualization of departures from independence, and of a two-class classification problem. Other potential applications are discussed.
منابع مشابه
Spatio-temporal variability of aerosol characteristics in Iran using remotely sensed datasets
The present study is the first attempt to examine temporal and spatial characteristics of aerosol properties and classify their modes over Iran. The data used in this study include the records of Aerosol Optical Depth (AOD) and Angstrom Exponent (AE) from MODerate Resolution Imaging Spectroradiometer (MODIS) and Aerosol Index (AI) from the Ozone Monitoring Instrument (OMI), obtained from 2005 t...
متن کاملRepresenting Spectral data using LabPQR color space in comparison to PCA method
In many applications of color technology such as spectral color reproduction it is of interest to represent the spectral data with lower dimensions than spectral space’s dimensions. It is more than half of a century that Principal Component Analysis PCA method has been applied to find the number of independent basis vectors of spectral dataset and representing spectral reflectance with lower di...
متن کاملCharacter Decompositions
Several decomposition formats exist for image data: principal components, independent components, non-negative components, etc. These decompositions have been applied mainly to natural image data. In this paper, we study the above decompositions for hand-written devanagari character data. We propose a new measure – spatial entropy – for characterizing datasets. Datasets with high spatial entrop...
متن کاملSpatio-temporal variability of aerosol characteristics in Iran using remotely sensed datasets
The present study is the first attempt to examine temporal and spatial characteristics of aerosol properties and classify their modes over Iran. The data used in this study include the records of Aerosol Optical Depth (AOD) and Angstrom Exponent (AE) from MODerate Resolution Imaging Spectroradiometer (MODIS) and Aerosol Index (AI) from the Ozone Monitoring Instrument (OMI), obtained from 2005 t...
متن کاملVisualization of cracks by using the local Voronoi decompositions and distributed software
The paper presents a novel visualization technique for cracks propagating in monodispersed particulate material. The proposed technique is based on local space decompositions generated in fractured areas. The contact surfaces of the neighbouring particles are defined by the local Voronoi decomposition generated according to the lattice topology employed in computations of the discrete element m...
متن کامل